智能论文笔记

Differentially Private Federated Learning on Heterogeneous Data

Maxence Noble , Aurélien Bellet , Aymeric Dieuleveut

分类：机器学习

2021-11-17

联邦学习（FL）是大规模分布式学习的范例，它面临两个关键挑战：（i）从高度异构的用户数据和（ii）保护参与用户的隐私的高效培训。在这项工作中，我们提出了一种新颖的流动方法（DP-SCaffold）来通过将差异隐私（DP）约束结合到流行的脚手架算法中来解决这两个挑战。我们专注于有挑战性的环境，用户在没有任何可信中介的情况下与“诚实但奇怪的”服务器沟通，这需要确保隐私不仅可以访问最终模型的第三方，而且还要对服务器观察所有用户通信。使用DP理论的高级结果，我们建立了凸面和非凸面目标算法的融合。我们的分析清楚地突出了数据异质性下的隐私式折衷，并且当局部更新的数量和异质性水平增长时，展示了在最先进的算法DP-Fedivg上的DP-Scaffold的优越性。我们的数值结果证实了我们的分析，并表明DP-Scaffold在实践中提供了重大的收益。

translated by 谷歌翻译

Biomedical image analysis competitions: The state of current participation practice

Matthias Eisenmann , Annika Reinke , Vivienn Weru , Minu Dietlinde Tizabi , Fabian Isensee , Tim J. Adler , Patrick Godau , Veronika Cheplygina , Michal Kozubek , Sharib Ali

分类：计算机视觉 | 机器学习

2022-12-16

The number of international benchmarking competitions is steadily increasing in various fields of machine learning (ML) research and practice. So far, however, little is known about the common practice as well as bottlenecks faced by the community in tackling the research questions posed. To shed light on the status quo of algorithm development in the specific field of biomedical imaging analysis, we designed an international survey that was issued to all participants of challenges conducted in conjunction with the IEEE ISBI 2021 and MICCAI 2021 conferences (80 competitions in total). The survey covered participants' expertise and working environments, their chosen strategies, as well as algorithm characteristics. A median of 72% challenge participants took part in the survey. According to our results, knowledge exchange was the primary incentive (70%) for participation, while the reception of prize money played only a minor role (16%). While a median of 80 working hours was spent on method development, a large portion of participants stated that they did not have enough time for method development (32%). 25% perceived the infrastructure to be a bottleneck. Overall, 94% of all solutions were deep learning-based. Of these, 84% were based on standard architectures. 43% of the respondents reported that the data samples (e.g., images) were too large to be processed at once. This was most commonly addressed by patch-based training (69%), downsampling (37%), and solving 3D analysis tasks as a series of 2D tasks. K-fold cross-validation on the training set was performed by only 37% of the participants and only 50% of the participants performed ensembling based on multiple identical models (61%) or heterogeneous models (39%). 48% of the respondents applied postprocessing steps.

translated by 谷歌翻译

A Comparison of New Swarm Task Allocation Algorithms in Unknown Environments with Varying Task Density

Grace Cai , Noble Harasha , Nancy Lynch

分类：机器人

2022-12-01

Task allocation is an important problem for robot swarms to solve, allowing agents to use reduce task completion time by performing tasks in a distributed fashion. Existing task allocation algorithms often assume prior knowledge of task location and demand or fail to consider the effects of the geometric distribution of tasks on the completion time and communication cost of the algorithms. In this paper, we examine an environment where agents must explore and discover tasks with positive demand and successfully assign themselves to complete all such tasks. We propose two new task allocation algorithms for initially unknown environments -- one based on N-site selection and the other on virtual pheromones. We analyze each algorithm separately and also evaluate the effectiveness of the two algorithms in dense vs. sparse task distributions. Compared to the Levy walk, which has been theorized to be optimal for foraging, our virtual pheromone inspired algorithm is much faster in sparse to medium task densities but is communication and agent intensive. Our site selection inspired algorithm also outperforms Levy walk in sparse task densities and is a less resource-intensive option than our virtual pheromone algorithm for this case. Because the performance of both algorithms relative to random walk is dependent on task density, our results shed light on how task density is important in choosing a task allocation algorithm in initially unknown environments.

translated by 谷歌翻译

Prototypical few-shot segmentation for cross-institution male pelvic structures with spatial registration

Yiwen Li , Yunguan Fu , Iani Gayo , Qianye Yang , Zhe Min , Shaheer Saeed , Wen Yan , Yipei Wang , J. Alison Noble , Mark Emberton

分类：计算机视觉

2022-09-12

在医学图像分析中需要进行几次学习的能力是对支持图像数据的有效利用，该数据被标记为对新类进行分类或细分新类，该任务否则需要更多的培训图像和专家注释。这项工作描述了一种完全3D原型的几种分段算法，因此，训练有素的网络可以有效地适应培训中缺乏的临床有趣结构，仅使用来自不同研究所的几个标记图像。首先，为了弥补机构在新型类别的情节适应中的广泛认识的空间变异性，新型的空间注册机制被整合到原型学习中，由分割头和空间对齐模块组成。其次，为了帮助训练观察到的不完美比对，提出了支持掩模调节模块，以进一步利用支持图像中可用的注释。使用589个骨盆T2加权MR图像的数据集分割了八个对介入计划的解剖结构的应用，该实验是针对介入八个机构的八个解剖结构的应用。结果证明了3D公式中的每种，空间登记和支持掩模条件的功效，所有这些条件都独立或集体地做出了积极的贡献。与先前提出的2D替代方案相比，不管支持数据来自相同还是不同的机构，都具有统计学意义的少量分割性能。

translated by 谷歌翻译

Anatomy-Aware Contrastive Representation Learning for Fetal Ultrasound

Zeyu Fu , Jianbo Jiao , Robail Yasrab , Lior Drukker , Aris T. Papageorghiou , J. Alison Noble

分类：计算机视觉

2022-08-22

自我监督的对比表示学习提供了从未标记的医学数据集中学习有意义的视觉表示的优势，以进行转移学习。但是，将当前的对比度学习方法应用于医疗数据而不考虑其特定区域的解剖学特征可能会导致视觉表示，这些视觉表示在外观和语义上是不一致的。在本文中，我们建议通过解剖学对比度学习（AWCL）改善医学图像的视觉表示，该学习结合了解剖学信息，以以对比度学习方式增强正/阴性对采样。为自动化的胎儿超声成像任务展示了所提出的方法，从而使从解剖学上相似的相同或不同的超声扫描实现了正对，这些扫描在解剖学上相似，可以将其拉在一起，从而改善了表示的学习。我们从经验上研究了与粗粒和细粒度的粒度纳入解剖信息的效果，以进行对比学习，并发现使用细粒度的解剖学信息的学习能够保留阶层内差异比其对应物更有效。我们还分析了解剖比对我们的AWCL框架的影响，发现使用更独特但解剖学上的样品构成阳性对的影响会带来更好的质量表示。大规模胎儿超声数据集的实验表明，我们的方法对学习表征有效，可以很好地转移到三个临床下游任务，并且与受监督的Imagenet和当前的先进对比度学习方法相比，取得了优越的性能。特别是，在跨域分割任务上，AWCL的表现优于Imagenet监督方法，高于13.8％，基于最先进的对比度方法的方法为7.1％。

translated by 谷歌翻译

Multimodal-GuideNet: Gaze-Probe Bidirectional Guidance in Obstetric Ultrasound Scanning

Qianhui Men , Clare Teng , Lior Drukker , Aris T. Papageorghiou , J. Alison Noble

分类：计算机视觉

2022-07-26

眼睛跟踪器可以在超声（US）扫描期间为超声检查员提供视觉指导。对于经验丰富的运营商来说，这种指导可能是有价值的，可以提高他们在操纵探测器以实现所需飞机方面的扫描技能。在本文中，提出了一种多模式的指导方法（多模式形式的指导方法）来捕获现实世界中的视频信号，同步注视和统一框架内的探测运动之间的逐步依赖性。为了了解目光运动与探测运动之间的因果关系，我们的模型利用多任务学习共同学习了两个相关任务：预测经验丰富的超声仪将在常规产科扫描中执行的凝视运动和探测信号。这两个任务通过模态感知的空间图关联，以检测多模式输入之间的共发生并共享有用的跨模式信息。多模式形式的扫描路径不是确定性的扫描路径，可以通过估计实际扫描的概率分布来扫描多样性。通过三个典型的产科扫描检查进行的实验表明，新方法在探针运动指导和凝视运动预测方面都优于单任务学习。多模态偏见还提供了一个视觉引导信号，对于224x288 US图像，错误率小于10像素。

translated by 谷歌翻译

Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

Aarohi Srivastava , Abhinav Rastogi , Abhishek Rao , Abu Awal Md Shoeb , Abubakar Abid , Adam Fisch , Adam R. Brown , Adam Santoro , Aditya Gupta , Adrià Garriga-Alonso

分类：自然语言处理 | 人工智能 | 机器学习 | (统计)机器学习

2022-06-09

语言模型既展示了定量的改进，又展示了新的定性功能，随着规模的增加。尽管它们具有潜在的变革性影响，但这些新能力的特征却很差。为了为未来的研究提供信息，为破坏性的新模型能力做准备，并改善社会有害的效果，至关重要的是，我们必须了解目前和近乎未来的能力和语言模型的局限性。为了应对这一挑战，我们介绍了超越模仿游戏基准（Big Bench）。 Big Bench目前由204个任务组成，由132家机构的442位作者贡献。任务主题是多样的，从语言学，儿童发展，数学，常识性推理，生物学，物理学，社会偏见，软件开发等等。 Big-Bench专注于被认为超出当前语言模型的功能的任务。我们评估了OpenAI的GPT型号，Google内部密集变压器体系结构和大型基础上的开关稀疏变压器的行为，跨越了数百万到数十亿个参数。此外，一个人类专家评估者团队执行了所有任务，以提供强大的基准。研究结果包括：模型性能和校准都随规模改善，但绝对的术语（以及与评估者的性能相比）；在模型类中的性能非常相似，尽管带有稀疏性。逐渐和预测的任务通常涉及大量知识或记忆成分，而在临界规模上表现出“突破性”行为的任务通常涉及多个步骤或组成部分或脆性指标；社交偏见通常会随着含糊不清的环境而随着规模而增加，但这可以通过提示来改善。

translated by 谷歌翻译

TempAMLSI : Temporal Action Model Learning based on Grammar Induction

Maxence Grand , Damien Pellier , Humbert Fiorino

分类：人工智能

2021-12-08

手动编码PDDL域通常被认为是困难，乏味的和容易出错的。当必须编码时间域时，难度更大。实际上，行动持续时间，它们的效果不是瞬间的。在本文中，我们提出了一种基于AMLSI方法的算法，该算法能够学习时间域。Tempamlsi基于在时间规划中完成的经典假设，即可以将非时间域转换为时间域。Tempamlsi是第一种能够使用单个硬信封和库欣的间隔学习时间域的方法。我们通过实验显示Tempamlsi能够学习准确的时间域，即可以直接用于解决新规划问题的时间域，具有不同形式的动作并发。

translated by 谷歌翻译